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Abstract 

Abstract: We investigate the probability distribution of the volatility return intervals r for 
the Chinese stock market. We rescale both the probability distribution Pg(r) and the volatil- 
ity return intervals r as Pq{T) = l/r/(r/r) to obtain a uniform scaling curve for different 
thi^eshold value q. The scaling curve can be well fitted by the stretched exponential function 
f{x) ~ e~°^^, which suggests memory exists in r. To demonstrate the memory effect, we 
investigate the conditional probability distribution i-'g(rlro), the mean conditional interval 
{t\tq) and the cumulative probability distribution of the cluster size of r. The results show 
clear- clustering effect. We further investigate the persistence probability distribution P±{t) 
and find that P~{t) decays by a power law with the exponent far different from the value 
0.5 for the random walk, which further confirms long memory exists in r. The scaling and 
long memory effect of r for the Chinese stock market are similar to those obtained from 
the United States and the Japanese financial markets. 
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1 Introduction 



In recent years, physicists have paid much attention on the dynamics of financial 
markets. Scaling behavior is discovered in the financial system by analyzing the 
indices and the stock prices y{t'), such as the 'fat tail' of the probability distribution 
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P{Z, t)of the two point price change return Z{t') = In y(t') — ln — l)[lll|2|]. The 
physical origin of the scaling behavior is often related to the long range correlation. 
It is interested to find, in spite of the absence of the return correlation, the volatility 
|Z(t') I is long range correlated 

Recently, the volatility return intervals r, which is defined as the return intervals 
that the volatility is above a certain threshold o, is investigated for the United States 
and the Japanese financial markets lH, 0, 0, [si M . Scaling behavior of the prob- 
ability distribution in the volatility return intervals r is discovered, and long-range 
autocorrelation is demonstrated for r. The scaling and the long-range autocorrela- 
tion are rather robust independent of the stock markets and the foreign exchange 
markets for the developed countries. However, it is known that the emerging mar- 



kets may behave differently lIllL Il2l 11311. Especially, the Chinese stock market is 



newly set up in 1990 and shares a transiting social and political system. Due to the 
special background of the Chinese stock market, it may share similar properties as 



the mature financial markets lll2ll . however, it may also exhibits special features 



far different from the mature financial markets in some aspects [[11 



as the leverage effect reported in ref. lIllL 11211 . It is important to investigate the 



12L[13|], such 



financial dynamics for the Chinese stock market to achieve more comprehensive 
understanding of the financial markets. 

In this paper, to broaden the understanding of the scaling and memory effect of the 
volatility return intervals r for the emerging markets, we investigate the probabil- 
ity distribution and the memory effect of r for the Chinese stock market. In the 
next section, we present the data set we analyzed. In section 3, we show the prob- 
ability distribution of r. In section 4, we investigate the clustering phenomena by 
analyzing the conditional probability distribution Pq(r|ro), the mean conditional 
interval {t\tq) and the cumulative distribution of the cluster size of r. In section 
5, we investigate the persistence probability distribution P±{t). Finally comes the 
conclusion. 



2 Data Analyzed 



The data we analyzed is based on the trade-by-trade data from the stocks of the 
Shanghai Stock Exchange market(SHSE) and the Shenzhen Stock Exchange mar- 
ket(SZSE). The SHSE was established on November 26, 1990 and put into opera- 
tion on December 19, 1990. Shortly after, the SZSE was established on December 
1, 1990 and put into operation on July 3, 1991. Most A-shares and B-shares are 
traded in the SHSE and SZSE. 

The Chinese stock market is an order-driven market and is based on the so called 
continuous double auction mechanism. In the trading day, there are 3 time periods. 
From 9: 15 to 9:25 a.m., it is the opening call auction time, when the buy and sell or- 
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ders are aggregated to match. From 9:25 to 9:30 a.m., it is the cool period, and then 
followed by the continuous double auction time. The time period for the continu- 
ous double auction is from 9:30 to 1 1:30 a.m. and from 13:00 to 15:00 p.m.. More 
Information about the development process and the current trading mechanism can 
be found in ref. lll3Lll4ill5L ll6ll. We study the transaction records for 3 whole year 
from 2003 to 2006. The number of the 3 whole year transactions is about 800,000 
on average. 



3 The Probability Distribution 



Here we define the volatility return intervals r(g) as the time intervals that volatility 
\Z(t') I above a certain threshold q, where the sampling time interval for the volatil- 
ity is 1 min. Therefore, r(g) depends on the threshold q. Fig. 1 shows the 
volatility return intervals for q = 0.50, q = 1.00 and q = 1.50 in May 2003 of 
the Datang Telecom Co., Ltd(DTT). The big value of q corresponds to the large 
volatility that rarely occurs in the financial markets. We investigate the probabil- 
ity distribution function(PDF) Pq(r) of the volatility return interval r(g) with the 
threshold q = 0.750, 0.875, 1.000, 1.125, 1.375 and 1.500. 
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Fig. 1. Volatility return intervals r for the DTT stock in May 2003 with the threshold values 
q =0.50, 1.00, 1.50 are displayed. 

Fig. 2a shows the PDF Pq{T) for the DTT stock and Fig. 2b shows the PDF Pg(r) 
for the Chinese Minsheng Banking Co., Ltd(CMB). The seven curves are for g=0.750, 
0.875, 1.000, 1.125, 1.375 and 1.500 respectively. The resuks show that the PDF 
Pg(r) for large q decays slower than that for small g. We rescale -Pg('r) and r as 
Pglr) = l/r/(r/r), which is mentioned in ref. B, BBS, §,[13], to collapse the 
seven curves with different threshold q onto a single curve, where r is the average 
interval. 

Fig. 3a and Fig. 3b shows the scaled PDF -Pg(r)r as a function of the scaled volatil- 
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Fig. 2. (a) Probability distribution functions Pg(r) for the DTT stock are displayed in 
log-log scale. The circles, squares, diamonds, triangle ups, triangle lefts, triangle downs 
and triangle rights are for the threshold values q from 0.75 to 1.50 (0.750, 0.875, 1.000, 
1.125, 1.250, 1.375 and 1.500) respectively, (b) Probability distribution functions P<j(r) for 
the CMB stock are displayed in log-log scale. The circles, squares, diamonds, triangle ups, 
triangle lefts, triangle downs and triangle rights are for the threshold values q from 0.75 to 
1.50 (0.750, 0.875, 1.000, 1.125, 1.250, 1.375 and 1.500 respectively. 



ity return intervals r/r for the DTT stock and the CMB stock. Fig. 3c shows the 
scaled PDF Pg(r)r for the DTT stock, the CMB stock, the CITIC Securities Co., 
Ltd. (CITIC) stock and the Bird Telecom Co., Ltd(BDT) stock. 

The scaling behavior is observed for the normalized r. The scaling function / (r/r) 
does not directly depend on the threshold q but through r = r(g). The scaling be- 
havior is similar to that obtained from the United States and the Japanese stock 
markets, i.e., the generality of the scaling is further confirmed for both the mature 
financial markets and the Chinese stock market. It helps us to overcome the diffi- 
culty to perform statistics for the rare event with big price fluctuation. If we know 
the Pg(r) with a small q, the behavior of r with large q then can be predicted by the 
scaling function. We fit the scaled PDF with a stretched exponential function form 



We find that the DTT stock and the CMB stock have similar exponent value with 
(7, a) = (0.20±0.05,4.0±0.5). The exponent is close to the value (7,0) = (0.38± 
0.05,3.9 ± 0.5) of the United States stock market [6]. The function form is far 
different from the Poission distribution, which indicates there may exist correlation 
in r. 
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Fig. 3. (a) Scaling of the volatility return intervals for the DTT stock with the threshold 
values q ranging from 0.75 to 1.50 (0.750, 0.875, 1.000, 1.125, 1.250, 1.375 and 1.500) 
are displayed with circles, squares, triangle ups, triangle downs, pluses, crosses and stars 
in log-log scale, (b) Scaling of the volatility return intervals for the CMB stock with the 
threshold values q ranging from 0.75 to 1.50 (0.750, 0.875, 1.000, 1.125, 1.250, 1.375 and 
1.500) ai^e displayed with circles, squares, triangle ups, triangle downs, pluses, crosses and 
stars in log-log scale, (c) Scaling of the volatility return intervals for the DTT stock, the 
CMB stock, the CITIC stock and the BDT stock with the threshold value q = 1.0 are 
displayed with circles, squares, crosses and stars in log-log scale. 

4 Clustering Phenomena of the VolatiUty Return Intervals 



If clustering effect occurs in time serials, it suggests memory exists in those seri- 
als. To demonstrate the memory in the volatility return intervals r, we investigate 
the clustering effect by study the conditional probability distribution Pq(r|ro), the 
mean conditional interval {t\tq) and the cumulative probability distribution of the 
cluster size of r. 



4. 1 Conditional Probability Distribution 



To investigate the memory effect of the volatility return intervals r, we analyze 
the conditional PDF Pq(r|ro). The Pq(r|ro) is denoted as the probability distribu- 
tion function of the r that immediately follow a given volatility return interval tq 
. If memory exists in r, the Py(r|ro) should depend on the preced- 
ing volatility return interval tq. To achieve good statistics with more data points, 
we sort the volatility return intervals in increasing direction and divide it into two 
subsets. Fig. 4 shows the scaled conditional PDF Pg(r|ro)/r for the DTT stock 
and the CMB stock with g=0.750, 0.875, 1.000, 1.125, 1.250, 1.375 and 1.500. 
The Pg(r |ro) /r for different threshold q are collapsed onto a single curve. It is ob- 
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served that the Pg(r|ro) /r for the lower subset is higher for the small tq while the 
Pqi'^l^o) for the larger subset is higher for the large tq, i.e., small r follows small 
To and large r follows large tq. It implies that the intervals with the similar size form 
clusters, i.e., there exists memory in r. The clustering effect has been investigated 
in ref. [SlGilTilsllS, 10], we obtain the similar result for the Chinese stock market. To 
guide the eye, we fit the data with the stretched exponential function f{x) ~ e~"^^ . 
For the DTT stock, the exponent is measured to be (7, a) = (0.30, 4.00) for the 
Lower 1/2 subset and (0.20,4.00) for the larger 1/2 subset. For the CMB stock, 
the exponent is measured to be (7, a) = (0.30, 4.0) for the Lower 1/2 subset and 
(0.20,3.50) for the larger 1/2 subset. 



(a) 



DTT stock 



q=0.750 
q=0.875 
q= 1.000 
q=1.125 
q= 1.250 
q=1.375 
q=1.500 



(b) 



CMB stock 



t/t 



t/t 



Fig. 4. (a) The scaled conditional probability distribution Pq(r|To)/r vs t/t for the DTT 
stock with the threshold values q ranging from 0.75 to 1.50 (0.750, 0.875, 1.000, 1.125, 
1.250, 1.375 and 1.500) are displayed with circles, squares, diamonds, triangle ups, tri- 
angle lefts, triangle downs and triangle rights. The closed symbols are for the lower 1/2 
subset, and the open symbols are for the larger 1/2 subset. The dashed lines are for guiding 
the eyes and with a stretched exponential form f{x) ~ e~"^^, where (7, a) = (0.30, 4.00) 
for the lower 1/2 subset and (0.20,4.00) for the larger 1/2 subset respectively, (b) The scaled 
conditional probability distribution Pg(r|ro)/r vs t/t for the CMB stock with the thresh- 
old values q ranging from 0.75 to 1.50 (0.750, 0.875, 1.000, 1.125, 1.250, 1.375 and 1.500) 
ai^e displayed with circles, squares, diamonds, triangle ups, triangle lefts, triangle downs 
and triangle rights. The closed symbols ai^e for the lower 1/2 subset, and the open symbols 
are for the larger 1/2 subset. The dashed lines are for guiding the eyes and with a stretched 
exponential form f{x) ~ e~"^ , where (7, a) = (0.30, 4.00) for the lower 1/2 subset and 
(0.20,3.50) for the larger 1/2 subset respectively 



4.2 Mean Conditional Interval 



To further demonstrate the memory effect of the volatility return intervals, we in- 
vestigate the mean conditional return interval {t\tq), which is defined as the mean 
of the volatility return intervals r that immediately follow a given tq subset. Fig. 5 
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shows the scaled mean conditional return interval {t\tq) /t for the DTT stock and 
the CMB stock with g=0.75, 1.00, 1.25. The closed symbols are for the volatility 
return intervals and the open symbols are for the shuffled data respectively. It is 
found that the shuffled data (t\tq) jr almost keeps a constant, i.e., there is no corre- 
lation in the shuffled data. However, the clustering phenomena that small r follows 
small To while large r follows large tq are observed for the volatility return intervals 
r. It further demonstrates the clustering effect of the intervals for the Chinese stock 
market. Therefore, it implies autocorrelation in r. The result is similar to that of the 
United States and the Tananese stock markets ll5[[^|7|.[8ll. 




Fig. 5. (a) The scaled mean conditional return interval (rjro) /r vs tq/t for the DTT stock 
with the thi^eshold values q=0.15, 1.00, 1.25 are displayed with circles, squares and tri- 
angles. The closed symbols are for the volatility return intervals r and the open symbols 
are for the shuffled data, (b) The scaled mean conditional return interval (r|ro) /r vs tq/t 
for the CMB stock with the threshold values g=0.75, 1.00, 1.25 are displayed with circles, 
squai^es and triangles. The closed symbols are for the volatility retuin intervals r and the 
open symbols are for the shuffled data. 

4.3 Cluster Size Distribution of the Volatility Return Intervals 

To investigate the clustering phenomena in a more direct way, we study the cumu- 
lative probability distribution of the cluster size of r. The cluster size is obtained 
by calculating the successive intervals with similar size [SiTilsiQ]. We separate the 
data into two sets by the median data of r. The data which are above (below) the 
median data is signed by '+' ('—')• Accordingly, n consecutive or '— ' intervals 
form a cluster and the corresponding cumulative probability distribution is denoted 
as Pn+i^) (Pn-(T))- Fig. 6 shows the cumulative probability distribution Pn±(r) 
for the DTT stock and the CMB stock with g=0.75, 1.00 and 1.25. The open sym- 
bols are for the Pn+i^) and the closed symbols are for the j5„-(r). We find that the 
Pn-{T) presents longer tail persisting up to about n = 25 than the p„+('r) does, 
i.e., the relative small value of r may form big clusters. Similar clusters have been 
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found in the United States and the Japanese stock markets [@, 0, Isl Isl] . It also indi- 
cates long memory exists in r. 
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Fig. 6. (a) Cumulative distribution of the cluster size of r for the DTT stock with the 
thi^eshold values g=0.75, 1.00, 1.25 are displayed with circles, squares and triangles. The 
open (closed) symbols are for the consecutive volatility return intervals that are all above 
(below) the median interval records, (b) Cumulative distribution of the cluster size of r 
for the CMB stock with the threshold values g=0.75, 1.00, 1.25 are displayed with circles, 
squares and triangles. The open (closed) symbols are for the consecutive volatility return 
intervals that are all above (below) the median interval records. 



5 Persistence Probability 



To achieve deeper understanding of the memory effect of the volatility return in- 
tervals, we investigate the persistence probability, which has been systematically 
studied in nonequilibriurn dynamics such as phase ordering dynamics and critical 
dynamics [|l8Lll9Ll20l l2lL l22ll. The idea of persistence is closely related to the first 
pass age time which has been widely studied in physics, biology and engineering 
[I23L I24I I25L I26L \2l\ . In general, the persistence probability provides additional in- 
formation to the autocorrelation. 



The persistence probability P+{t) (P_{t)) is defined as the probability that r(t' + f) 
has always been above (below) r(t') in time t, i.e., r(t' + t)> T(t') {rit' + t)< 
T{t')) for all t < t. The average is taken over t'. In Fig. 7, the persistence probability 
distribution of r is plotted in log-log scale with g=0.75, 1.00 and 1.25. It is found 
that P4. (t) decays much faster than P„ (r) does. The P_ (t) is observed to decay by a 
power-law P-{t) ~ The high-low asymmetry of the persistence probability is 
similar to that of the volatility [19, 2^, 21, 22]. The persistence exponents measured 
from the slopes of P_(t) with g=0.75, 1.00 and 1.25 are close and estimated to be 
f3 = 0.25 ± 0.05 for the DTT stock, and (3 = 0.86 ± 0.05 for the CMB stock. The 
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exponents are far different from those of the random walk, where both Pj^ (t) and 
P-{t) show a power law behavior with a persistence exponent P = 0.50. It further 
supports that lona-ranae correlation exists in r. 




Fig. 7. (a) Persistence probability of the volatility return intervals r for the DTT stock with 
the threshold values q=0J5, 1.00, 1.25 ai^e displayed with circles, squares and triangles in 
log-log scale. The open (closed) symbols ai^e for p^{t) {p+{t)). The solid line(the dotted 
line) is the p~{t) ip+{t)) for the random walk, (b) Persistence probability of the volatility 
return interval r for the CMB stock with the threshold values 5=0.75, 1.00, 1.25 are dis- 
played with circles, squares and triangles in log-log scale. The open (closed) symbols are 
forp_(t) (p+(t)). 



6 Conclusion 



In summary, we have investigated the probability distribution function Pq(r) of the 
volatility return intervals r for the Chinese stock market. Scaling behavior is ob- 
served after Pg(r) and r are rescaled as Pg(r) = l/r/(r/r). The scaling curve 
can be fitted by a stretched exponential function /(x) ~ e~"^'^ with a = 0.39 
and 7 = 4.00, which is far different from the Poission distribution. It suggests 
there exists memory in r. We then study the conditional probability distribution 
Pg(r|ro) and the mean conditional return interval {t\tq). The results show that both 
the P(j{t\tq) and the {t\tq) depend on the previous volatility return intervals tq. 
To obtain the clustering phenomena in a more direct way, we investigate the cu- 
mulative probability distribution of the cluster size of r. Clear clustering effect is 
observed, especially for the relative small value of r. We further investigate the 
persistence probability distribution of r. It is found that P_ (t) decays by a power 
law, with the exponent f3 = 0.25 ± 0.05 for the DTT stock and /3 = 0.86 ± 0.05 
for the CMB stock, which is far different from the value 0.5 for the random walk. 
The Pq(r|ro), the {t\to) and the cumulative probability distribution of the cluster 
size of r for the Chinese stock market are similar to those obtained from the United 
States and the Japanese stock markets. The persistence probability further confirms 
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the long memory of r for the Chinese stock market. Compared with the mature fi- 
nancial markets, we find that, as a emerging market, the Chinese stock market may 
have some unique features, however, it shares the similar scaling and long memory 
properties for the volatility return intervals as the United States and the Japanese 
stock markets. 
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